Importing Store.csv into pyspark dataframe.

Printing the schema of the PySpark Dataframe

Checking for Missing Value and Fixing the Same

Outlier Analysis

The outliers are fixed by replacing the value with mean value.

ML Pre-processing

ML Train Test Data

PROPHET Model for Time Series Forecast

As we can see the model is doing good. We implemented the Prophet Model which is a model we didnt try out before in python itself as in the PySpark Library we coudnt identify any Module for TimeSeries Algorithms.